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GENE EXPRESSION MARKERS FOR RESPONSE TO EGFR INHIBITOR DRUGS 

Background of the Invention 
The present application claims the benefit under 35 U.S.C. 1 19(e) of the filing date of 
5 U. S. Application Serial No. 60/445,968 filed on February 6, 2003. 
Field of the Invention 

The present invention concerns gene expression profiling of tissue samples obtained 
from patients who are candidates for treatment with a therapeutic EGFR inhibitor. More 
specifically, the invention provides methods based on the molecular characterization of gene 
10 expression in paraffin-embedded, fixed cancer tissue samples, which allow a physician to 
predict whether a patient is likely to respond well to treatment with an EGFR inhibitor. 
Description of the Related Art 

Oncologists have a number of treatment options available to them, 
including different combinations of chemotherapeutic drugs that are characterized as 
15 "standard of care," and a number of drugs that do not carry a label claim for particular cancer, 
but for which there is evidence of efficacy in that cancer. Best likelihood of good treatment 
outcome requires that patients be assigned to optimal available cancer treatment, and that this 
assignment be made as quickly as possible following diagnosis. 

Currently, diagnostic tests used in clinical practice are single analyte, and therefore do 
20 not capture the potential value of knowing relationships between dozens of different markers. 
Moreover, diagnostic tests are frequently not quantitative, relying on immunohistochemistry. 
This method often yields different results in different laboratories, in part because the 
reagents are not standardized, and in part because the interpretations are subjective and cannot 
be easily quantified. RNA-based tests have not often been used because of the problem of 
25 RNA degradation over time and the fact that it is difficult to obtain fresh tissue samples from 
patients for analysis. Fixed paraffin-embedded tissue is more readily available. Fixed tissue 
has been routinely used for non-quantitative detection of RNA, by in situ hybridization. 
However, recently methods have been established to quantify RNA in fixed tissue, using RT- 
PCR. This technology platform can also form the basis for multi-analyte assays 
30 Recently, several groups have published studies concerning the classification of 

various cancer types by microarray gene expression analysis (see, e.g. Golub et ah, Science 
286:531-537 (1999); Bhattacharjae etal, Proc. Natl Acad. Sci USA 98:13790-13795 (2001); 
Chen-Hsiang et al, Bioinformatics 17 (Suppl. 1):S316-S322 (2001); Ramaswamy et al. f 
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Proc. Natl Acad. Set USA 98:15149-15154 (2001)). Certain classifications of human breast 
cancers based on gene expression patterns have also been reported (Martin et aL, Cancer Res. 
60:2232-2238 (2000); West et al, Proc. Natl. Acad. Sci. USA 98:11462-11467 (2001); Sorlie 
et al, Proc. Natl Acad. Set. USA 98:10869-10874 (2001); Yan et al., Cancer Res. 61:8375- 
5 8380 (2001)). However, these studies mostly focus on improving and refining the already 
established classification of various types of cancer, including breast cancer, and generally do 
not link the findings to treatment strategies in order to improve the clinical outcome of cancer 
therapy. 

Although modern molecular biology and biochemistry have revealed hundreds of 
10 genes whose activities influence the behavior of tumor cells, the state of their differentiation, 
and their sensitivity or resistance to certain therapeutic drugs, with a few exceptions, the 
status of these genes has not been exploited for the purpose of routinely making clinical 
decisions about drug treatments. One notable exception is the use of estrogen receptor (ER) 
protein expression in breast carcinomas to select patients to treatment with anti-estrogen 
15 drugs, such as tamoxifen. Another exceptional example is the use of ErbB2 (Her2) protein 
expression in breast carcinomas to select patients with the Her2 antagonist drug Herceptin® 
(Genentech, Inc., South San Francisco, CA). 

Despite recent advances, a major challenge in cancer treatment remains to target 
specific treatment regimens to pathogenically distinct tumor types, and ultimately personalize 
20 tumor treatment in order to optimize outcome. Hence, a need exists for tests that 
simultaneously provide predictive information about patient responses to the variety of 
treatment options. 

Summary of the Invention 
The present invention is based on findings of a Phase II clinical study of gene 
25 expression in tissue samples obtained from human patients with non-small cell lung cancer 
(NSCLC) who responded or did not respond to treatment with EGFR inhibitors. 

In one embodiment, the invention concerns a method for predicting the likelihood that 
a patient who is a candidate for treatment with an EGFR inhibitor will respond to such 
treatment, comprising determining the expression level of one or more prognostic RNA 
30 transcripts or their expression products in a cancer tissue sample obtained from the patient, 
wherein the prognostic transcript is the transcript of one or more genes selected from the 
group consisting of: STAT5A, STAT5B, WISP1, CKAP4, FGFR1, cdc25A, RASSF1, G- 
Catenin, H2AFZ, NME1, NRG1, BC12, TAGLN, YB-1, Src, IGF1R, CD44, DIABLO, 
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TIMP2, AREG, PDGFRa, CTSB, Hepsin, ErbB3 5 MTA1, Gus, and VEGF., wherein (a) over- 
expression of the transcript of one or more of STATS A, STAT5B, WISP1, CKAP4, FGFR1, 
cdc25A, RASSF1, G-Catenin, H2AFZ, NME1, NRG1, BC12, TAGLN, YB1, Src, IGF1R, 
CD44, DIABLO, TIMP2, AREG, PDGFRa, and CTSB, or the corresponding expression 
5 product, indicates that the patient is not likely to respond well to the treatment, and (b) over- 
expression of the transcript of one or more of Hepsin, ErbB3, MTA, Gus, and VEGF, or the 
corresponding expression product, indicates that the patient is likely to respond well to the 
treatment. 

The tissue sample preferably is a fixed, paraffin-embedded tissue. Tissue can be 

10 obtained by a variety of methods, including fine needle, aspiration, bronchial lavage, or 
transbronchial biopsy. 

In a specific embodiment, the expression level of the prognostic RNA transcript or 
transcripts is determined by RT-PCR. In this case, and when the tissue sample is fixed, and 
paraffin-embedded, the RT-PCR amplicons (defined as the polynucleotide sequence spanned 

15 by the PCR primers) should preferably be less than 100 bases in length. In other 
embodiments, the levels of the expression product of the prognostic RNA transcripts are 
determined by other methods known in the art, such as immunohistochemistry, or proteomics 
technology. The assays for measuring the prognostic RNA transcripts or their expression 
products may be available in a kit format. 

20 In another aspect, the invention concerns an array comprising polynucleotides 

hybridizing to one or more of the following genes: STATS A, STAT5B, WISP1, CKAP4, 
FGFR1, cdc25A, RASSF1, G-Catenin, H2AFZ, NME1, NRG1, BC12, TAGLN, YB1, Src, 
IGF1R, CD44, DIABLO, TIMP2, AREG, PDGFrA, CTSB, Hepsin, ErbB3, MTA, Gus, and 
VEGF, immobilized on a solid surface. The polynucleotides can be cDNA or 

25 oligonucleotides. The cDNAs are typically about 500 to 5000 bases long, while the 
oligonucleotides are typically about 20 to 80 bases long. An array can contain a very large 
number of cDNAs, or oligonucleotides, e.g. up to about 330,000 oligonucleotides. The solid 
surface presenting the array can, for example, be glass. The levels of the product of the gene 
transcripts can be measured by any technique known in the art, including, for example, 

3 0 immunohistochemistry or proteomics. 

In various embodiments, the array comprises polynucleotides hybridizing to two at 
least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at 
least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen, at 
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least fifteen, at least seventeen, at least eighteen, at least nineteen, at least twenty, at least 
twenty-one, at least twenty-two, at least twenty-three, at least twenty-four, at least twenty- 
five, at least twenty-six, or all twenty-seven of the genes listed above. In a particular 
embodiment, hybridization is performed under stringent conditions. 
5 The invention further concerns a method of preparing a personalized genomics profile 

for a patient, comprising the steps of: 

(a) subjecting RNA extracted from cancer tissue obtained from the patient to gene 
expression analysis; 

(b) determining the expression level in the tissue of one or more genes selected 
10 from the group consisting of STATS A, STAT5B, WISP1, CKAP4, FGFrl, cdc25A, RASSF1, 

G-Catenin, H2AFZ, NME1, NRG1, BC12, TAGLN, YB1, Src, IGF1R, CD44, DIABLO, 
TIMP2, AREG, PDGFRA, CTSB, Hepsin, ErbB3, MTA, Gus, and VEGF, wherein the 
expression level is normalized against a control gene or genes and optionally is compared to 
the amount found in a corresponding cancer reference tissue set; and 
15 (c) creating a report summarizing the data obtained by said gene expression 

analysis. 

The invention additionally concerns a method for amplification of a gene selected 
from the group consisting of STATS A, STAT5B, WISP1, CKAP4, FGFrl, cdc25A, RASSF1, 
G-Catenin, H2AFZ, NME1, NRG1, BC12, TAGLN, YB1, Src, IGF1R, CD44, DIABLO, 
20 TIMP2, AREG, PDGFRA, CTSB, Hepsin, ErbB3, MTA, Gus, and VEGF by polymerase 
chain reaction (PCR), comprising performing said PCR by using a corresponding amplicon 
listed in Table 3, and a corresponding primer-probe set listed in Table 4. 

The invention further encompasses any PCR primer-probe set listed in Tables 4, and 
any PCR amplicon listed in Table 3. 
25 In yet another aspect, the invention concerns a prognostic method comprising: 

(a) subjecting a sample comprising cancer cells obtained from a patient to 
quantitative analysis of the expression level of the RNA transcript of at least one gene 
selected from the group consisting of STATS A, STAT5B, WISP1, CKAP4 5 FGFR1, cdc25A, 
RASSF1, G-Catenin, H2AFZ, NME1, NRG1, BC12, TAGLN, YB1, Src, IGF1R, CD44, 

30 DIABLO, TIMP2, AREG, PDGFRa, and CTSB, or their product, and 

(b) identifying the patient as likely to have a decreased likelihood of responding 
well to treatment with an EGFR inhibitor if the normalized expression levels of said gene or 
genes, or their products, are elevated above a defined expression threshold. 
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In a further aspect, the invention concerns a prognostic method comprising: 

(a) subjecting a sample comprising cancer cells obtained from a patient to 
quantitative analysis of the expression level of the RNA transcript of at least one gene 
selected from the group consisting of Hepsin, ErbB3, MTA, Gus, and VEGF or their product, 

5 and 

(b) identifying the patient as likely to have an increased likelihood of responding 
well to treatment with an EGFR inhibitor if the normalized expression levels of said gene or 
genes, or their products, are elevated above a defined expression threshold. 

Detailed Description of the Preferred Embodiment 

10 A. Definitions 

Unless defined otherwise, technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Singleton et aL, Dictionary of Microbiology and Molecular Biology 2nd ed., J. 
Wiley & Sons (New York, NY 1994), and March, Advanced Organic Chemistry Reactions, 

15 Mechanisms and Structure 4th ed., John Wiley & Sons (New York, NY 1992), provide one 
skilled in the art with a general guide to many of the terms used in the present application. 

One skilled in the art will recognize many methods and materials similar or equivalent 
to those described herein, which could be used in the practice of the present invention. 
Indeed, the present invention is in no way limited to the methods and materials described. For 

20 purposes of the present invention, the following terms are defined below. 

The term "microarray" refers to an ordered arrangement of hybridizable array 
elements, preferably polynucleotide probes, on a substrate. 

The term "polynucleotide," when used in singular or plural, generally refers to any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or 

25 modified RNA or DNA. Thus, for instance, polynucleotides as defined herein include, 
without limitation, single- and double-stranded DNA, DNA including single- and double- 
stranded regions, single- and double-stranded RNA, and RNA including single- and double- 
stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded 
or, more typically, double-stranded or include single- and double-stranded regions. In 

30 addition, the term "polynucleotide" as used herein refers to triple-stranded regions comprising 
RNA or DNA or both RNA and DNA. The strands in such regions may be from the same 
molecule or from different molecules. The regions may include all of one or more of the 
molecules, but more typically involve only a region of some of the molecules. One of the 
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molecules of a triple-helical region often is an oligonucleotide. The term "polynucleotide" 
specifically includes cDNAs. The term includes DNAs (including cDNAs) and RNAs that 
contain one or more modified bases. Thus, DNAs or RNAs with backbones modified for 
stability or for other reasons are "polynucleotides" as that term is intended herein. Moreover, 
5 DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as 
tritiated bases, are included within the term "polynucleotides" as defined herein. In general, 
the term "polynucleotide" embraces all chemically, enzymatically and/or metabolically 
modified forms of unmodified polynucleotides, as well as the chemical forms of DNA and 
RNA characteristic of viruses and cells, including simple and complex cells. 
10 The term "oligonucleotide" refers to a relatively short polynucleotide, including, 

without .limitation, single-stranded deoxyribonucleotides, single- or double-stranded 
ribonucleotides, RNA:DNA hybrids and double-stranded DNAs. Oligonucleotides, such as 
single-stranded DNA probe oligonucleotides, are often synthesized by chemical methods, for 
example using automated oligonucleotide synthesizers that are commercially available. 
15 However, oligonucleotides can be made by a variety of other methods, including in vitro 
recombinant DNA-mediated techniques and by expression of DNAs in cells and organisms. 

The terms "differentially expressed gene," "differential gene expression" and their 
synonyms, which are used interchangeably, refer to a gene whose expression is activated to a 
higher or lower level in a subject suffering from a disease, specifically cancer, such as breast 
20 cancer, relative to its expression in a normal or control subject. The terms also include genes 
whose expression is activated to a higher or lower level at different stages of the same 
disease. It is also understood that a differentially expressed gene may be either activated or 
inhibited at the nucleic acid level or protein level, or may be subject to alternative splicing to 
result in a different polypeptide product. Such differences may be evidenced by a change in 
25 mRNA levels, surface expression, secretion or other partitioning of a polypeptide, for 
example. Differential gene expression may include a comparison of expression between two 
or more genes or their gene products, or a comparison of the ratios of the expression between 
two or more genes or their gene products, or even a comparison of two differently processed 
products of the same gene, which differ between normal subjects and subjects suffering from 
30 a disease, specifically cancer, or between various stages of the same disease. Differential 
expression includes both quantitative, as well as qualitative, differences in the temporal or 
cellular expression pattern in a gene or its expression products among, for example, normal 
and diseased cells, or among cells which have undergone different disease events or disease 
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stages. For the purpose of this invention, "differential gene expression" is considered to be 
present when there is at least an about two-fold, preferably at least about four-fold, more 
preferably at least about six-fold, most preferably at least about ten-fold difference between 
the expression of a given gene in normal and diseased subjects, or in various stages of disease 
5 development in a diseased subject. 

The term "over-expression" with regard to an RNA transcript is used to refer the level 
of the transcript determined by normalization to the level of reference mRNAs, which might 
be all measured transcripts in the specimen or a particular reference set of mRNAs. 

The phrase "gene amplification 1 ' refers to a process by which multiple copies of a gene 

10 or gene fragment are formed in a particular cell or cell line. The duplicated region (a stretch 
of amplified DNA) is often referred to as "amplicon." Usually, the amount of the messenger 
RNA (mRNA) produced, i.e., the level of gene expression, also increases in the proportion of 
the number of copies made of the particular gene expressed. 

The term "prognosis" is used herein to refer to the prediction of the likelihood of 

15 cancer-attributable death or progression, including recurrence, metastatic spread, and drug 
resistance, of a neoplastic disease, such as non-small cell lung cancer, or head and neck 
cancer. The term "prediction" is used herein to refer to the likelihood that a patient will 
respond either favorably or unfavorably to a drug or set of drugs, and also the extent of those 
responses, or that a patient will survive, following surgical removal or the primary tumor 

20 and/or chemotherapy for a certain period of time without cancer recurrence. The predictive 
methods of the present invention can be used clinically to make treatment decisions by 
choosing the most appropriate treatment modalities for any particular patient. The predictive 
methods of the present invention are valuable tools in predicting if a patient is likely to 
respond favorably to a treatment regimen, such as surgical intervention, chemotherapy with a 

25 given drug or drug combination, and/or radiation therapy, or whether long-term survival of 
the patient, following surgery and/or termination of chemotherapy or other treatment 
modalities is likely. 

The term "long-term" survival is used herein to refer to survival for at least 1 year, 
more preferably for at least 2 years, most preferably for at least 5 years following surgery or 
3 0 other treatment. 

The term "increased resistance" to a particular drug or treatment option, when used in 
accordance with the present invention, means decreased response to a standard dose of the 
drug or to a standard treatment protocol. 
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The term "decreased sensitivity" to a particular drug or treatment option, when used in 
accordance with the present invention, means decreased response to a standard dose of the 
drug or to a standard treatment protocol, where decreased response can be compensated for 
(at least partially) by increasing the dose of drug, or the intensity of treatment. 
5 "Patient response" can be assessed using any endpoint indicating a benefit to the 

patient, including, without limitation, (1) inhibition, to some extent, of tumor growth, 
including slowing down and complete growth arrest; (2) reduction in the number of tumor 
cells; (3) reduction in tumor size; (4) inhibition (i.e., reduction, slowing down or complete 
stopping) of tumor cell infiltration into adjacent peripheral organs and/or tissues; (5) 

10 inhibition (i.e. reduction, slowing down or complete stopping) of metastasis; (6) enhancement 
of anti-tumor immune response, which may, but does not have to, result in the regression or 
rejection of the tumor; (7) relief, to some extent, of one or more symptoms associated with 
the tumor; (8) increase in the length of survival following treatment; and/or (9) decreased 
mortality at a given point of time following treatment. 

15 The term "treatment" refers to both therapeutic treatment and prophylactic or 

preventative measures, wherein the object is to prevent or slow down (lessen) the targeted 
pathologic condition or disorder. Those in need of treatment include those already with the 
disorder as well as those prone to have the disorder or those in whom the disorder is to be 
prevented. In tumor (e.g., cancer) treatment, a therapeutic agent may directly decrease the 

20 pathology of tumor cells, or render the tumor cells more susceptible to treatment by other 
therapeutic agents, e.g., radiation and/or chemotherapy. 

The term "tumor," as used herein, refers to all neoplastic cell growth and proliferation, 
whether malignant or benign, and all pre-cancerous and cancerous cells and tissues. 

The terms "cancer" and "cancerous" refer to or describe the physiological condition in 

25 mammals that is typically characterized by unregulated cell growth. Examples of cancer 
include but are not limited to, breast cancer, colon cancer, lung cancer, prostate cancer, 
hepatocellular cancer, gastric cancer, pancreatic cancer, cervical cancer, ovarian cancer, liver 
cancer, bladder cancer, cancer of the urinary tract, thyroid cancer, renal cancer, carcinoma, 
melanoma, head and neck cancer, and brain cancer. 

30 The "pathology" of cancer includes all phenomena that compromise the well-being of 

the patient. This includes, without limitation, abnormal or uncontrollable cell growth, 
metastasis, interference with the normal functioning of neighboring cells, release of cytokines 
or other secretory products at abnormal levels, suppression or aggravation of inflammatory or 
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immunological response, neoplasia, premalignancy, malignancy, invasion of surrounding or 
distant tissues or organs, such as lymph nodes, etc. 

The term "EGFR inhibitor" as used herein refers to a molecule having the ability to 
inhibit a biological function of a native epidermal growth factor receptor (EGFR). 
5 Accordingly, the term "inhibitor" is defined in the context of the biological role of EGFR. 
While preferred inhibitors herein specifically interact with (e.g. bind to) an EGFR, molecules 
that inhibit an EGFR biological activity by interacting with other members of the EGFR 
signal transduction pathway are also specifically included within this definition. A preferred 
EGFR biological activity inhibited by an EGFR inhibitor is associated with the development, 

10 growth, or spread of a tumor. EGFR inhibitors, without limitation, include peptides, non- 
peptide small molecules, antibodies, antibody fragments, antisense molecules, and 
oligonucleotide decoys. 

"Stringency 11 of hybridization reactions is readily determinable by one of ordinary 
skill in the art, and generally is an empirical calculation dependent upon probe length, 

15 washing temperature, and salt concentration. In general, longer probes require higher 
temperatures for proper annealing, while shorter probes need lower temperatures. 
Hybridization generally depends on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. The 
higher the degree of desired homology between the probe and hybridizable sequence, the 

20 higher the relative temperature which can be used. As a result, it follows that higher relative 
temperatures would tend to make the reaction conditions more stringent, while lower 
temperatures less so. For additional details and explanation of stringency of hybridization 
reactions, see Ausubel et al., Current Protocols in Molecular Biology , Wiley Interscience 
Publishers, (1995). 

25 "Stringent conditions" or "high stringency conditions", as defined herein, typically: (1) 

employ low ionic strength and high temperature for washing, for example 0.015 M sodium 
chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during 
hybridization a denaturing agent, such as formamide, for example, 50% (v/v) formamide with 
0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/5 OmM sodium 

30 phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42°C; or 
(3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium 
phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon 
sperm DNA (50 (ag/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 
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0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide at 55°C, followed by a high- 
stringency wash consisting of 0.1 x SSC containing EDTA at 55°C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al., 
Molecular Cloning: A Laboratory Manual . New York: Cold Spring Harbor Press, 1989, and 
5 include the use of washing solution and hybridization conditions (e.g., temperature, ionic 
strength and %SDS) less stringent that those described above. An example of moderately 
stringent conditions is overnight incubation at 37°C in a solution comprising: 20% 
formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate 
(pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared 
10 salmon sperm DNA, followed by washing the filters in 1 x SSC at about 37-50°C. The 
skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary 
to accommodate factors such as probe length and the like. 

In the context of the present invention, reference to "at least one," "at least two,' 5 "at least 
five," etc. of the genes listed in any particular gene set means any one or any and all 

15 combinations of the genes listed. 

The terms "expression threshold," and "defined expression threshold" are used 
interchangeably and refer to the level of a gene or gene product in question above which the 
gene or gene product serves as a predictive marker for patient survival without cancer 
recurrence. The threshold is defined experimentally from clinical studies such as those 

20 described in the Example below. The expression threshold can be selected either for 
maximum sensitivity, or for maximum selectivity, or for minimum error. The determination 
of the expression threshold for any situation is well within the knowledge of those skilled in 
the art. 

B. Detailed Description 

25 The practice of the present invention will employ, unless otherwise indicated, 

conventional techniques of molecular biology (including recombinant techniques), 
microbiology, cell biology, and biochemistry, which are within the skill of the art. Such 
techniques are explained fully in the literature, such as, "Molecular Cloning: A Laboratory 
Manual", 2 nd edition (Sambrook et al, 1989); "Oligonucleotide Synthesis" (M.J. Gait, ed., 

30 1984); "Animal Cell Culture" (R.I. Freshney, ed., 1987); "Methods in Enzymology" 
(Academic Press, Inc.); "Handbook of Experimental Immunology", 4 th edition (D.M. Weir & 
C.C. Blackwell, eds. 5 Blackwell Science Inc., 1987); "Gene Transfer Vectors for Mammalian 
Cells" (J.M. Miller & M.P. Calos, eds., 1987); "Current Protocols in Molecular Biology" 
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(F.M. Ausubel et al., eds., 1987); and "PGR: The Polymerase Chain Reaction", (Mullis et al., 
eds., 1994). 

1. Gene Expression Profiling 

In general, methods of gene expression profiling can be divided into two large groups: 
5 methods based on hybridization analysis of polynucleotides, and methods based on 
sequencing of polynucleotides. The most commonly used methods known in the art for the 
quantification of mRNA expression in a sample include northern blotting and in situ 
hybridization (Parker & Barnes, Methods in Molecular Biology 106:247-283 (1999)); RNAse 
protection assays (Hod, Biotechniques 13:852-854 (1992)); and reverse transcription 

10 polymerase chain reaction (RT-PCR) (Weis et al, Trends in Genetics 8:263-264 (1992)). 
Alternatively, antibodies may be employed that can recognize specific duplexes, including 
DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. 
Representative methods for sequencing-based gene expression analysis include Serial 
Analysis of Gene Expression (SAGE), and gene expression analysis by massively parallel 

15 signature sequencing (MPSS). 

2. Reverse Transcrivtase PGR (RT-PCR) 

Of the techniques listed above, the most sensitive and most flexible quantitative 
method is RT-PCR, which can be used to compare mRNA levels in different sample 
populations, in normal and tumor tissues, with or without drug treatment, to characterize 
20 patterns of gene expression, to discriminate between closely related mRNAs, and to analyze 
RNA structure. 

The first step is the isolation of mRNA from a target sample. The starting material is 
typically total RNA isolated from human tumors or tumor cell lines, and corresponding 
normal tissues or cell lines, respectively. Thus RNA can be isolated from a variety of primary 
25 tumors, including breast, lung, colon, prostate, brain, liver, kidney, pancreas, spleen, thymus, 
testis, ovary, uterus, head and neck, etc., tumor, or tumor cell lines, with pooled DNA from 
healthy donors. If the source of mRNA is a primary tumor, mRNA can be extracted, for 
example, from frozen or archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue 
samples. 

30 General methods for mRNA extraction are well known in the art and are disclosed in 

standard textbooks of molecular biology, including Ausubel et al. 9 Current Protocols of 
Molecular Biology , John Wiley and Sons (1997). Methods for RNA extraction from paraffin 
embedded tissues are disclosed, for example, in Rupp and Locker, Lab Invest. 56:A67 (1987), 
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and De Andres et ah, BioTechniques 18:42044 (1995). In particular, RNA isolation can be 
performed using purification kit, buffer set and protease from commercial manufacturers, 
such as Qiagen, according to the manufacturer's instructions. For example, total RNA from 
cells in culture can be isolated using Qiagen RNeasy mini-columns. Other commercially 
5 available RNA isolation kits include MasterPure™ Complete DNA and RNA Purification Kit 
(EPICENTRE®, Madison, WI), and Paraffin Block RNA Isolation Kit (Ambion, Inc.). Total 
RNA from tissue samples can be isolated using RNA Stat-60 (Tel-Test). RNA prepared from 
tumor can be isolated, for example, by cesium chloride density gradient centrifugation. 

As RNA cannot serve as a template for PGR, the first step in gene expression profiling 

10 by RT-PCR is the reverse transcription of the RNA template into cDNA, followed by its 
exponential amplification in a PCR reaction. The two most commonly used reverse 
transcriptases are avilo myeloblastosis virus reverse transcriptase (AMV-RT) and Moloney 
murine leukemia virus reverse transcriptase (MMLV-RT). The reverse transcription step is 
typically primed using specific primers, random hexamers, or oligo-dT primers, depending on 

15 the circumstances and the goal of expression profiling. For example, extracted RNA can be 
reverse-transcribed using a GeneAmp RNA PCR kit (Perkin Elmer, CA, USA), following the 
manufacturer's instructions. The derived cDNA can then be used as a template in the 
subsequent PCR reaction. 

Although the PCR step can use a variety of thermostable DNA-dependent DNA 

20 polymerases, it typically employs the Taq DNA polymerase, which has a 5 5 -3 9 nuclease 
activity but lacks a 3' -5' proofreading endonuclease activity. Thus, TaqMan® PCR typically 
utilizes the 5 '-nuclease activity of Taq or Tth polymerase to hydrolyze a hybridization probe 
bound to its target amplicon, but any enzyme with equivalent 5' nuclease activity can be used. 
Two oligonucleotide primers are used to generate an amplicon typical of a PCR reaction. A 

25 third oligonucleotide, or probe, is designed to detect nucleotide sequence located between the 
two PCR primers. The probe is non-extendible by Taq DNA polymerase enzyme, and is 
labeled with a reporter fluorescent dye and a quencher fluorescent dye. Any laser-induced 
emission from the reporter dye is quenched by the quenching dye when the two dyes are 
located close together as they are on the probe. During the amplification reaction, the Taq 

30 DNA polymerase enzyme cleaves the probe in a template-dependent manner. The resultant 
probe fragments disassociate in solution, and signal from the released reporter dye is free 
from the quenching effect of the second fluorophore. One molecule of reporter dye is 
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liberated for each new molecule synthesized, and detection of the unquenched reporter dye 
provides the basis for quantitative interpretation of the data. 

TaqMan® RT-PCR can be performed using commercially available equipment, such 
as, for example, AM PRISM 7700™ Sequence Detection System™ (Perkin-Elmer-Applied 
5 Biosystems, Foster City, CA, USA), or Lightcycler (Roche Molecular Biochemicals, 
Mannheim, Germany). In a preferred embodiment, the 5' nuclease procedure is run on a real- 
time quantitative PCR device such as the ABI PRISM 7700™ Sequence Detection System™. 
The system consists of a thermocycler, laser, charge-coupled device (CCD), camera and 
computer. The system amplifies samples in a 96-well format on a thermocycler. During 

10 amplification, laser-induced fluorescent signal is collected in real-time through fiber optics 
cables for all 96 wells, and detected at the CCD. The system includes software for running 
the instrument and for analyzing the data. 

5 '-Nuclease assay data are initially expressed as Ct, or the threshold cycle. As 
discussed above, fluorescence values are recorded during every cycle and represent the 

15 amount of product amplified to that point in the amplification reaction. The point when the 
fluorescent signal is first recorded as statistically significant is the threshold cycle (Q). 

To minimize errors and the effect of sample-to-sample variation, RT-PCR is usually 
performed using an internal standard. The ideal internal standard is expressed at a relatively 
constant level among different tissues, and is unaffected by the experimental treatment. 

20 RNAs frequently used to normalize patterns of gene expression axe mRNAs for the 
housekeeping genes glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) and p-actin. 

A more recent variation of the RT-PCR technique is the real time quantitative PCR, 
which measures PCR product accumulation through a dual-labeled fluorigenic probe (i.e., 
TaqMan® probe). Real time PCR is compatible both with quantitative competitive PCR, 

25 where internal competitor for each target sequence is used for normalization, and with 
quantitative comparative PCR using a normalization gene contained within the sample, or a 
housekeeping gene for RT-PCR. For further details see, e.g. Held et al., Genome Research 
6:986-994 (1996). 

The steps of a representative protocol for profiling gene expression using fixed, 
30 paraffin-embedded tissues as the RNA source, including mRNA isolation, purification, 
primer extension and amplification are given in various published journal articles {for 
example: T.E. Godfrey et al. J. Molec. Diagnostics 2: 84-91 [2000]; K. Specht et aL, Am. J. 
Pathol. 158: 419-29 [2001]}. Briefly, a representative process starts with cutting about 10 pm 
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thick sections of paraffin-embedded tumor tissue samples. The RNA is then extracted, and 
protein and DNA are removed. After analysis of the RNA concentration, RNA repair and/or 
amplification steps may be included, if necessary, and RNA is reverse transcribed using gene 
specific promoters followed by RT-PCR. 
5 3. Microarravs 

Differential gene expression can also be identified, or confirmed using the microarray 
technique. Thus, the expression profile of breast cancer-associated genes can be measured in 
either fresh or paraffin-embedded tumor tissue, using microarray technology. In this method, 
polynucleotide sequences of interest (including cDNAs and oligonucleotides) are plated, or 

10 arrayed, on a microchip substrate. The arrayed sequences are then hybridized with specific 
DNA probes from cells or tissues of interest. Just as in the RT-PCR method, the source of 
mRNA typically is total RNA isolated from human tumors or tumor cell lines, and 
corresponding normal tissues or cell lines. Thus RNA can be isolated from a variety of 
primary tumors or tumor cell lines. If the source of mRNA is a primary tumor, mRNA can be 

15 extracted, for example, from frozen or archived paraffin-embedded and fixed (e.g. formalin- 
fixed) tissue samples, which are routinely prepared and preserved in everyday clinical 
practice. 

In a specific embodiment of the microarray technique, PCR amplified inserts of 
cDNA clones are applied to a substrate in a dense array. Preferably at least 10,000 nucleotide 

20 sequences are applied to the substrate. The microarrayed genes, immobilized on the 
microchip at 10,000 elements each, are suitable for hybridization under stringent conditions. 
Fluorescently labeled cDNA probes may be generated through incorporation of fluorescent 
nucleotides by reverse transcription of RNA extracted from tissues of interest. Labeled 
cDNA probes applied to the chip hybridize with specificity to each spot of DNA on the array. 

25 After stringent washing to remove non-specifically bound probes, the chip is scanned by 
confocal laser microscopy or by another detection method, such as a CCD camera. 
Quantitation of hybridization of each arrayed element allows for assessment of corresponding 
mRNA abundance. With dual color fluorescence, separately labeled cDNA probes generated 
from two sources of RNA are hybridized pairwise to the array. The relative abundance of the 

30 transcripts from the two sources corresponding to each specified gene is thus determined 
simultaneously. The miniaturized scale of the hybridization affords a convenient and rapid 
evaluation of the expression pattern for large numbers of genes. Such methods have been 
shown to have the sensitivity required to detect rare transcripts, which are expressed at a few 
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copies per cell, and to reproducibly detect at least approximately two-fold differences in the 
expression levels (Schena et al, Proc. Natl Acad. Set USA 93(2): 106-149 (1996)). 
Microarray analysis can be performed by commercially available equipment, following 
manufacturer's protocols, such as by using the Affymetrix GenChip technology, or Agilent 5 's 
5 microarray technology. 

The development of microarray methods for large-scale analysis of gene expression 
makes it possible to search systematically for molecular markers of cancer classification and 
outcome prediction in a variety of tumor types. 

4. Serial Analysis of Gene Expression (SAGE) 

10 Serial analysis of gene expression (SAGE) is a method that allows the simultaneous 

and quantitative analysis of a large number of gene transcripts, without the need of providing 
an individual hybridization probe for each transcript. First, a short sequence tag (about 10-14 
bp) is generated that contains sufficient information to uniquely identify a transcript, provided 
that the tag is obtained from a unique position within each transcript. Then, many transcripts 

15 are linked together to form long serial molecules, that can be sequenced, revealing the identity 
of the multiple tags simultaneously. The expression pattern of any population of transcripts 
can be quantitatively evaluated by determining the abundance of individual tags, and 
identifying the gene corresponding to each tag. For more details see, e.g. Velculescu et al, 
Science 270:484-487 (1995); and Velculescu et al, Cell 88:243-51 (1997). 

20 5. Gene Expression Analysis by Massively Parallel Signature Sequencing (MPSS) 

This method, described by Brenner et al, Nature Biotechnology 18:630-634 (2000), is 
a sequencing approach that combines non-gel-based signature sequencing with in vitro 
cloning of millions of templates on separate 5 jam diameter microbeads. First, a microbead 
library of DNA templates is constructed by in vitro cloning. This is followed by the assembly 

25 of a planar array of the template-containing microbeads in a flow cell at a high density 
(typically greater than 3 x 10 6 microbeads/cm 2 ). The free ends of the cloned templates on 
each microbead are analyzed simultaneously, using a fluorescence-based signature 
sequencing method that does not require DNA fragment separation. This method has been 
shown to simultaneously and accurately provide, in a single operation, hundreds of thousands 

30 of gene signature sequences from a yeast cDNA library. 
6. Immunoh istochemistrv 

Immunohistochemistry methods are also suitable for detecting the expression levels of 
the prognostic markers of the present invention. Thus, antibodies or antisera, preferably 
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polyclonal antisera, and most preferably monoclonal antibodies specific for each marker are 
used to detect expression. The antibodies can be detected by direct labeling of the antibodies 
themselves, for example, with radioactive labels, fluorescent labels, hapten labels such as, 
biotin, or an enzyme such as horse radish peroxidase or alkaline phosphatase. Alternatively, 
5 unlabeled primary antibody is used in conjunction with a labeled secondary antibody, 
comprising antisera, polyclonal antisera or a monoclonal antibody specific for the primary 
antibody. Immunohistochemistry protocols and kits are well known in the art and are 
commercially available. 

7. Proteomics 

10 The term "proteome" is defined as the totality of the proteins present in a sample (e.g. 

tissue, organism, or cell culture) at a certain point of time. Proteomics includes, among other 
things, study of the global changes of protein expression in a sample (also referred to as 
"expression proteomics"). Proteomics typically includes the following steps: (1) separation 
of individual proteins in a sample by 2-D gel electrophoresis (2-D PAGE); (2) identification 
15 of the individual proteins recovered from the gel, e.g. my mass spectrometry or N-terminal 
sequencing, and (3) analysis of the data using bioinformatics. Proteomics methods are 
valuable supplements to other methods of gene expression profiling, and can be used, alone or 
in combination with other methods, to detect the products of the prognostic markers of the 
present invention. 
20 8. EGFR Inhibitors 

The epidermal growth factor receptor (EGFR) family (which includes EGFR, erb-B2, 
erb-B3, and erb-B4) is a family of growth factor receptors that are frequently activated in 
epithelial malignancies. Thus, the epidermal growth factor receptor (EGFR) is known to be 
active in several tumor types, including, for example, ovarian cancer, pancreatic cancer, non- 
25 small cell lung cancer {NSCLC}, breast cancer, and head and neck cancer. Several EGFR 
inhibitors, such as ZD 1839 (also known as gefitinib or Iressa); and OSI774 (Erlotinib, 
Tarceva™), are promising drug candidates for the treatment of cancer. 

Iressa, a small synthetic quinazoline, competitively inhibits the ATP binding site of 
EGFR, a growth-promoting receptor tyrosine kinase, and has been in Phase III clinical trials 
30 for the treatment of non-small-cell lung carcinoma. Another EGFR inhibitor, [agr]cyano- 
[bgr]methyl-A r -[(trifluoromethoxy)phenyl]-propenamide (LFM-A12), has been shown to 
inhibit the proliferation and invasiveness of human breast cancer cells. 
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Cetuximab is a monoclonal antibody that blocks the EGFR and EGFR-dependent cell 
growth. It is currently being tested in phase III clinical trials. 

Tarceva™ has shown promising indications of anti-cancer activity in patients with 
advanced ovarian cancer, and non-small cell lung and head and neck carcinomas. 
5 The present invention provides valuable molecular markers that predict whether a 

patient who is a candidate for treatment with an EGFR inhibitor drug is likely to respond to 
treatment with an EGFR inhibitor. 

The listed examples of EGFR inhibitors represent both small organic molecule and 
anti-EGFR antibody classes of drugs. The findings of the present invention are equally 
10 applicable to other EGFR inhibitors, including, without limitation, antisense molecules, small 
peptides, etc. 

9. General Description of the mRNA Isolation, Purification and Amplification 
The steps of a representative protocol for profiling gene expression using fixed, 

paraffin-embedded tissues as the RNA source, including mRNA isolation, purification, 

15 primer extension and amplification are given in various published journal articles {for 

> 

example: T.E. Godfrey et al. J. Molec. Diagnostics 2: 84-91 [2000]; K. Specht et al., Am. J. 
Pathol 158: 419-29 [2001]}. Briefly, a representative process starts with cutting about 10 \m\ 
thick sections of paraffin-embedded tumor tissue samples. The RNA is then extracted, and 
protein and DNA are removed. After analysis of the RNA concentration, RNA repair and/or 
20 amplification steps may be included, if necessary, and RNA is reverse transcribed using gene 
specific promoters followed by RT-PCR. Finally, the data are analyzed to identify the best 
treatment option(s) available to the patient on the basis of the characteristic gene expression 
pattern identified in the tumor sample examined. 

10. Cancer Gene Set Assayed Gene Subsequences, and Clinical Application of 
25 Gene Expression Data 

An important aspect of the present invention is to use the measured expression of 
certain genes by cancer (e.g. lung cancer) tissue to provide prognostic information. For this 
purpose it is necessary to correct for (normalize away) both differences in the amount of RNA 
assayed and variability in the quality of the RNA used. Therefore, the assay typically 
30 measures and incorporates the expression of certain normalizing genes, including well known 
housekeeping genes, such as GAPDH and Cypl. Alternatively, normalization can be based 
on the mean or median signal (Ct) of all of the assayed genes or a large subset thereof (global 
normalization approach). On a gene-by-gene basis, measured normalized amount of a patient 
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tumor mRNA is compared to the amount found in a cancer tissue reference set. The number 
(N) of cancer tissues in this reference set should be sufficiently high to ensure that different 
reference sets (as a whole) behave essentially the same way. If this condition is met, the 
identity of the individual cancer tissues present in a particular set will have no significant 
impact on the relative amounts of the genes assayed. Usually, the cancer tissue reference set 
consists of at least about 30, preferably at least about 40 different FPE cancer tissue 
specimens. Unless noted otherwise, normalized expression levels for each mRNA/tested 
tumor/patient will be expressed as a percentage of the expression level measured in the 
reference set. More specifically, the reference set of a sufficiently high number (e.g. 40) of 
tumors yields a distribution of normalized levels of each mRNA species. The level measured 
in a particular tumor sample to be analyzed falls at some percentile within this range, which 
can be determined by methods well known in the art. Below, unless noted otherwise, 
reference to expression levels of a gene assume normalized expression relative to the 
reference set although this is not always explicitly stated. 

Further details of the invention will be apparent from the following non-limiting 
Example. 

Example 

A Phase II Study of Gene Expression in non-small cell lung cancer fNSCL) 

A gene expression study was designed and conducted with the primary goal to 

molecularly characterize gene expression in paraffin-embedded, fixed tissue samples of 

NSCLC patients who did or did not respond to treatment with an EGFR inhibitor. The results 

are based on the use of one EGFR inhibitor. 

Study design 

Molecular assays were performed on paraffin-embedded, formalin-fixed tumor tissues 
obtained from 29 individual patients diagnosed with NSCLC. Patients were included in the 
study only if histopathologic assessment, performed as described in the Materials and 
Methods section, indicated adequate amounts of tumor tissue. All patients had a history of 
prior treatment for NSCLC, and the nature of pretreatment varied. 
Materials and Methods 

Each representative tumor block was characterized by standard Mstopathology for 
diagnosis, semi-quantitative assessment of amount of tumor, and tumor grade. A total of 6 
sections (10 microns in thickness each) were prepared and placed in two Costar Brand 
Microcentrifuge Tubes (Polypropylene, 1.7 mL tubes, clear; 3 sections in each tube). If the 
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tumor constituted less than 30% of the total specimen area, the sample may have been 
dissected by the pathologist, putting the tumor tissue directly into the Costar tube. 

If more than one tumor block was obtained as part of the surgical procedure, the block 
most representative of the pathology was used for analysis. 
5 Gene Expression Analysis 

mRNA was extracted and purified from fixed, paraffin-embedded tissue samples, and 
prepared for gene expression analysis as described above. 

Molecular assays of quantitative gene expression were performed by RT-PCR, using 
the ABI PRISM 7900™ Sequence Detection System™ (Perkin-Eimer-Applied Biosystems, 
10 Foster City, CA, USA). ABI PRISM 7900™ consists of a thermocycler, laser, 
charge-coupled device (CCD), camera and computer. The system amplifies samples in a 
384-well format on a thermocycler. During amplification, laser-induced fluorescent signal is 
collected in real-time through fiber optics cables for all 384 wells, and detected at the CCD. 
The system includes software for running the instrument and for analyzing the data. 
15 Analysis and Results 

Tumor tissue was analyzed for 185 cancer-related genes and 7 reference genes. The 
threshold cycle (CT) values for each patient were normalized based on the mean of all genes 
for that particular patient. Clinical outcome data were available for all patients. 

Outcomes were evaluated in two ways, each breaking patients into two groups with 
20 respect to response. 

One analysis categorized complete or partial response [RES] as one group, and stable 
disease (min of 3 months) or progressive disease as the other group [NR]. The second 
analysis grouped patients with respect to clinical benefit, where clinical benefit was defined 
as partial response, complete response, or stable disease at 3 months. 
25 Response (partial response and complete response) was determined by the Response 

Evaluation Criteria Li Solid Tumors (RECIST criteria). Stable disease was designated as the 
absence of aggressive disease for 3 or more months. 
Analysis of 17 patients by t-test 

Analysis was performed on all 17 treated patients to determine the relationship 
30 between normalized gene expression and the binary outcomes of RES (response) or NR (non- 
response). A t test was performed on the group of patients classified as RES or NR and the p- 
values for the differences between the groups for each gene were calculated. The following 
table lists the 23 genes for which the p-value for the differences between the groups was 
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<0.10. In this case response was defined as a partial or complete response, the former being 
>50% shrink of the tumor and the latter being disappearance of the tumor. As shown, 
response was identified in two patients. 
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Table 1 

No Yes 





No Resp 


Yes Resp 








Resp 


Resp 




Mean 


Mean 


t-value 


df 


p 


Valid 
N 


Valid N 


STAT5A.1 


-0.9096 


-2.1940 


3.48829 


15 


0.003302 


15 


2 


STAT5B.2 


-0.9837 


-2.281 1 


3.35057 


15 


0.004380 


15 


2 


WISP1.1 


-3.8768 


-6.1318 


2.88841 


15 


0.011256 


15 


2 


CKAP4.2 


-0.1082 


-1.0934 


2.54034 


15 


0.022627 


15 


2 


FGFR1.3 


-3.0647 


-4.9591 


2.42640 


15 


0.028323 


15 


2 


cdc25A.4 


-4.3752 


-5.2888 


2.28383 


15 


0.037373 


15 


2 


RASSF1.3 


-1.8402 


-2.8002 


2.28308 


15 


0.037427 


15 


2 


ErbB3.1 


-10.0166 


-8.7599 


-2.13036 


15 


0.050103 


15 


2 


GUS.1 


-2.2284 


-1 .2524 


-2.12833 


15 


0.050296 


15 


2 


NRG 1.3 


-7.6976 


-10.2172 


2.10836 


15 


0.052227 


15 


2 


Bcl2.2 


-2.4212 


-3.9768 


2.10197 


15 


0.052859 


15 


2 


Hepsin.1 


-7.2602 


-5.0055 


-2.09847 


15 


0.053208 


15 


2 


CTSB.1 


3.2027 


2.0683 


2.06857 


15 


0.056279 


15 


2 


TAGLN.3 


1 .7465 


0.0009 


2.05991 


15 


0.057199 


15 


2 


YB-1 .2 


1 .3480 


0.8782 


2.03095 


15 


0.060374 


15 


2 


Src.2 


-0.0393 


-0.9239 


1.93370 


15 


0.072248 


15 


2 


IGF1R.3 


-2.8269 


-3.7970 


1.93140 


15 


0.072553 


15 


2 


CD44S.1 


0.0729 


-1.3075 


1.90370 


15 


0.076315 


15 


2 


DIABLO. 1 


-3.6865 


-4.4254 


1.84770 


15 


0.084461 


15 


2 


VEGF.1 


1 .3981 


2.3817 


-1.82941 


15 


0.087285 


15 


2 


TIMP2.1 


2.5347 


1.4616 


1.82763 


15 


0.087565 


15 


2 


AREG.2 


-1 .5665 


-4.5616 


1.82558 


15 


0.087887 


15 


2 


PDGFRa.2 


-0.8243 


-2.7529 


1.79553 


15 


0.092738 


15 


2 



In the foregoing Table 1, lower mean expression C t values indicate lower expression 
and, conversely, higher mean expression values indicate higher expression of a particular 
5 gene. Thus, for example, expression of the STATS A or STAT5B gene was higher in patients 
who did not respond to EGFR inhibitor treatment than in patients that did respond to the 
treatment. Accordingly, elevated expression of STAT5A or STAT5B is an indication of 
poor outcome of treatment with an EGFR inhibitor. Phrasing it differently, if the STATS A or 
STAT5B gene is over-expressed in a tissue simple obtained from the cancer of a NSCLC 
10 patient, treatment with an EGFR inhibitor is not likely to work, therefore, the physician is 
well advised to look for alternative treatment options. 

Accordingly, the elevated expression of STATS A, STAT5B, WISP1, CKAP4, 
FGFR1, cdc25A or RASSFlin a tumor is an indication that the patient is not likely to respond 
well to treatment with an EGFR inhibitor. On the other hand, elevated expression of ErbB3 is 
15 an indication that the patient is likely to respond to EGFR inhibitor treatment. 
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In Table 2 below the binary analysis was carried with respect to clinical benefit, 
defined as either partial response, complete response, or stable disease. As shown, 5 patients 
met these criteria for clinical benefit. 

Table 2 

No Yes 





No Benefit 


Yes Benefit 








Benefit 


Benefit 




Mean 


Mean 


t-value 


df 


P 


Valid N 


Valid N 


G-Catenin.1 


0.0595 


-0.7060 


2.28674 


15 


0.037164 


12 


5 


Hepsin.1 


-7.4952 


-5.7945 


-2.28516 


15 


0.037277 


12 


5 


ErbB3.1 


-10.1269 


-9.2493 


-2.09612 


15 


0.053444 


12 


5 


MTA1.1 


-2.3587 


-1.6977 


-1 .94548 


15 


0.070705 


12 


5 


H2AFZ.2 


-1.0432 


-1.6448 


1 .82569 


15 


0.087869 


12 


5 


NME1.3 


0.4774 


-0.1769 


1 .80874 


15 


0.090578 


12 


5 


LMYC.2 


-3.6259 


-3.2175 


-1.71006 


15 


0.107853 


12 


5 


AREG.2 


-1.3375 


-3.3140 


1 .67977 


15 


0.113704 


12 


5 


Surfact A1.1 


-1.9341 


2.9822 


-1.63410 


15 


0.123046 


12 


5 


CDH1.3 


-1.3614 


-2.1543 


1 .59764 


15 


0.130971 


12 


5 


PTPD1.2 


-2.7517 


-2.0708 


-1 .52929 


15 


0.147004 


12 


5 



As shown in the above Table 2, 6 genes correlated with clinical benefit with p<0.1. 
Expression of G-catenin, H2AFZ, and NME1 was higher in patients who did not respond to 
anti-EGFR treatment. Thus, greater expression of these genes is an indication that patients 
are unlikely to benefit from anti-EGFR treatment. Conversely, expression of Hepsin, ErbB3, 
and MTA was higher in patients who did respond to anti-EGFR treatment. Greater 
expression of these genes indicates that patients are likely to benefit from anti-EGFR 
treatment. 

Table 3 shows the accession numbers and amplicon sequences used during the PGR 
amplification of the genes identified. 

Table 4 shows the accession numbers and the sequences of the primer/probe sets used 
during the PGR amplification of the genes identified. For each gene the forward primer 
sequence is identified as £2, the probe sequence as p2, and the reverse primer sequence as r2. 

It is emphasized that while the data presented herein were obtained using tissue 
samples from NSCLC, the conclusions drawn from the tissue expression profiles are equally 
applicable to other cancers, such as, for example, colon cancer, ovarian cancer, pancreatic 
cancer, breast cancer, and head and neck cancer. 

All references cited throughout the specification are hereby expressly incorporated by 
reference. 
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TABLE 4 

Accession 



Gene 


Number 


Part Name 


Sequence 


Lenath 
****** * y *•• * 


NME1 


NM. 


000269 


S2528/NME1.p3 


CCTGGGACCATCCGTGGAGACTTCT 


25 


NRG1 - 




013957 


S1240/NRG1.f3 


CGAGACTCTCCTCATAGTGAAAGGTAT 


27 


NRG1 


nm] 


013957 


S1241/NRG1.r3 


CTTGGCGTGTGGAAATCTACAG 


22 


NRG-1 - 


; nm~ 


013957 


S1242/NRG1.p3- 


ATGACCACCCCGGCTCGTATGTCA 


24 


PDGFRa 


NM_006206 


S0226/PDGFRa.f2 


GGGAG I J I CCAAGAGATGGA 


20 


PDGFRa 


NM. 


006205 . 


•S0227/PDGFRa.p2 


CC C AAG ACC C G AC C AAG C A CTAG 


23 


PDGFRa 


nm] 


[006206 


S0228/PDGFRa.r2 


CTTCAACCACCTTCCCAAAC 


20 


RASSF1 • 


nm~ 


JD07182 ' 


S2393/RASSF1.f3 


AGTGGGAGACACCTGACCTT 


20 


RASSF1 


nm" 


J)07182 


S2394/RASSF1.r3 


TGATCTGGGCATTGTACTCC 


20 


RASSF1 


nm" 


JD07182 


* S2395/RASSF1.p3 


TTG ATCTTCTG CTC AATCTC AG CTTG AG A 


29 


Src 


nm" 


]004383 


S1820/Srci2 


CCTGAACATGAAGGAGCTGA 


20 


Src 


•nm] 


JD04383 


S1821/Src.r2 


CATCACGTCTCCGAACTCC 


19 


Src 


nm] 


J)04383 


S1822/Src.p2 


TCCCGATGGTCTGCAGCAGCT 


21 


STAT5A 


nm" 


JD03152 


S1219/STAT5A.f1 


GAGGCGCTCAACATGAAATTC 


•21 


STAT5A 


nm] 


JD03152 


S1220/STAT5A.r1 


GCCAGGAACACGAGGTTCTC 


20 


STAT5A 


nm" 


J303152 


S12217STAT5A.p1 


CGGTTGCTCTGCACTTCGGCCT 


22 


STAT5B 


nm] 


]o 12448 


S2399/STAT5B/f2 


C CAGTG GTGGTG ATC GTTC A 


20 


STATS B 


nm. 


.012448 • 


S2400/STAT5B.r2 


GCAAAAGCATTGTCCCAGAGA - 


21 


STAT5B 


1 NM. 


.012448 


' S2401/STAT5B.p2 


CAGCCAGGACAACAATGCGACGG 


23 


TAGLN 


nm] 


.003186 . 


S3185/TAGLN.f3 


GATGGAGCAGGTGGCTCAGT 


20 


TAGLN ' 


' nm" 


.003186 


S3186/TAGLN.r3 


: AGTCTGGAACATGTCAGTCTTGATG 




TAGLN 


NM 


.003186 


S3187/TAGLN.p3 


C C C AG AGTCCTC AG C CG C CTTC AG 


2d 


TIMP2 


NM 


.003255 . 


; S1680/TIMP2.ff 


• TCACCCTCTGTGACTTCATCGT 


99 


TJMP2 


NM. 


.003255 


S1681/T!MP2.r1 


TGTGGTTCAGGCTCTTCTTCTG 


22 


TIMP2 


NM. 


.003255 


S1682/TIMP2.p1 


CCCTGGGACACCCTGAGCACCA 


22 


VEGF 


nm] 


.003376 


S0286/VEGF.f1 


CT G CTGTCTTG G GTG C ATTG 


20 


VEGF 


NM. 


.003376 


S0287/VEGF.p.1 


TTG C CTTG CT G CTCT AC CT C C AC C A 


25 


VEGF 


NM. 


.003376 


S0288A/EGF.r1 


GCAGCCTGGGACCACTTG 


18 


W1SP1 


NM 


.003882 


S1671/WlSP1.f1 


AGAGGCATCCATGAACTTCACA 


22 


WISP1 


NM. 


.003882 


S1672/WISP1.ri ' 


CAAACTCCACAGTACTTGGGTTGA 


24 


WISP1 


NM. 


.003882 


S1673/WlSP1.p1 


CGGGCTGCATCAGCACACGC 


20 


YB-1 


NM. 


.004559 


S1194/YB-1.f2 


AGACTGTGGAGTTTGATGTTGTTGA 


25 


YB-1 


NM 


.004559^ 


S1195/YB-1.r2 


GGAACACCACCAGGACCTGTAA 


22 


YB-1 


NM 


.004559 


S1199/YB-1.p2 


TTGCTGCCTCCGCACCC I i 1 1 CT 
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Accession 


Gene 


Number 


AREG 


NMJD01657 


AREG 


NMJ)01657 


AREG 


NM_001657 


Bc!2 


NM_000633 


Bcl2 


NMJD00633 


Bc!2 


NMJ3O0633 


CD44S 


' M59040 


CD44s 


M59040 • 


CD44s ' 


M59040 


cdc25A 


NM_0OT789 


cdc25A 


NMJ301/89 


cdc25A 


NMJ)01789 


CKAP4 


.NMJD06825 


CKAP4 


NMJX36825 


CKAP4 


NMJD06825 


CTSB 


NMJD01908 


CTSB 


NMJ301908 


CTSB 


NMJD01908 


DIABLO 


NMJ3 19887 


DIABLO 


NMJ3 19887 


DIABLO 


NMJD1 9887 


ErbB3 


NMJ301982 


ErbB3 


NMJD01982 


ErbB3 


NMJ301982 


FGFR1 


NM_023109 


FGFR1 


NM_023109 


FGFR1 ' 


NMJ323109 


G-Catenin 


NMJ3O2230 


G-Catenin 


NMJ)02230 


G-Catenin 


NMJ302230 


GUS 


NMJ300181 


GUS 


NM_000181 


GUS 


NMJD00181 


H2AFZ 


N.MJD02106 


H2AFZ 


NMJ3O2106 


H2AFZ 


. NMJ)02106 


Hepsin 


NMJ302151 


Hepsin • ' 


NMJ302151 


Hepsin 


NMJ302151 


IGF1R 


NMJD00875 


IGF1R 


NMJD0Q.875 


IGF1R 


■ NMJ3O0875 


MTA1 


NMJ304689 


MTA1 


NMJD04689 


MTA1 


NM„004689 


NME1 


NM_000269 


NME1 


NMJD00269 



Part Name 

S0025/AREG.f2 
S0026/AREG.p2 
S0027/AREG.r2 
S0043/Bcl2.f2 
S0044/Bcl2.p2 
S0045/Bci2.r2 , 
S3102/CD44s.f1 :. 
S3103/CD44s.r1 
S3104/CD44s.p1 ,' 
• S0070/cdc25A.f4 
S0071/cdc25A.p4 
S0072/cdc25A.r4 
S2381/CKAP4J2 
S2382/CKAP4.r2. 
S2383/CKAP4.p2 
S1146/CTSB.f1 

. S1147/CTSB:r1 
S1180/CTSB.p1 
S0808/DIABLO.f1 
S0809/DIABLO.r1 
S1105/OIABLO.p1 
S0112/ErbB3.ft 
S0113/ErbB3,p1 
•S0114/ErbB3,r1 
■S0818/FGFR1.f3 
S0819/FGFR1.r3 
S1110/FGFR1.p3 
S2153/G-Cate.ft 
S2154/G-Cate.r1 
S2155/G-Cate.p1 
S0139/GUS.f1 
S0140/GUS.p1 

, S0141/GUS.M 
S3012/H2AFZ.f2 
S3013/H2AFZ.r2 
S3Q14/H2AFZ.p2 
S2269/Hepsin.f1 
S2270/Hepsin.r1 
S2271/Hepsin.p1 
■S1249/IGF1R.f3 
S1250/IGF1R.r3 
S1251/IGF1R.p3 
S2369/MTA1 .f1 
S2370/MTA1 .r1 
'S2371/MTAl.p1 
S2526/NME1.f3 
S2527/NME1.r3 



TABLE 4 

Sequence Length 

TGTGAGTGAAATGCCTTCTAGTAGTGA 27 

CCGTCCTCGGGAGCCGACTATGA 23 

TTGTGGTTCGTTATCATACTCTTCTGA 27 

CAGATGGACCTAGTACCCACTGAGA 25 

TTCCACGCCGAAGGACAGCGAT 22 

CCTATGATTTAAGGGCATTTTTCC 24 

GACGAAG ACAGTCCCTGGAT 20 

ACTGGGGTGGAATGTGTCTT 20 

CACCGACAGCACAGACAGAATCCC 24 

TCTTGCTGGCTACGCCTCTT 20 

TGTCCCTGTTAGACGTCCTCCGTCCATA 28 

CTGCATTGTGGCACAGTTCTG 21 

AAAGCCTCAGTCAGCCAAGT 20 

AACCAAACTGTCCACAGCAG 20 

TCCTGAGCATTTTCAAGTCCGCCT 24 

GGCCGAGATCTACAAAAACG 20 

GCAGGAAGTCCGAATACACA 20 

. CCCCGTGGAGGGAGCTTTCTC 21 

CACAATGGCGGCTCTGAAG ' 19 

ACACAAAGACTGTCTGTACCTGAAGA 26 
AAGTTACGCTGCGCGACAGCCAA . 23 

CGGTTATGTCATGCCAGATACAC 23 
CCTCAAAGGTACTCCCTCCTCCCGG ' 25 

GAACTGAGACCCACTGAAGAAAGG 24" 

CACGGGACATTCACCACATC 20 
G G G TG C C ATC C AC TTC AC A ' ' 19 

ATAAAAAGACAACCAACGGCCGACTGC 27 

TCAGCAGCAAGGGCATCAT 19 

GGTGGTTTTCTTGAGCGTGTACT 23 

CGCCCGCAGGCCTCATCCT 1 9 

C CC AC TC AG TAG C C AAG TC A 20 

TCAAGTAAACGGGCTGTTTTCCAAACA 27 

CACGCAGGTGGTATCAGTCT 20 

CCGGAAAGGCCAAGACAA 18 

AATACGGCCCACTGGGAACT 20 

CCCGCTCGCAGAGAGCCGG 19 

AGGCTGCTGGAGGTCATCTC . 20 

CTTCCTGCGGCCACAGTCT 19 

CCAGAGGCCGTTTCTTGGCCG 21 

GCATGGTAGCCGAAGATTTCA 21 

TTTCCGGTAATAGTCTGTCTCATAGATATC 30 

CGCGTCATACCAAAATCTCCGATTTTGA 28 

CCGCCCTCACCTGAAGAGA 19 

GGAATAAGTTAGCCGCGCTTCT ' 22 

CCCAGTGTCCGCCAAGGAGCG 21 

CCAACCCTGCAGACTCCAA 19 

ATGTATAATGTTCCTGCCAACTTGTATG 28 
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WHAT IS CLAIMED IS: 

1 . A method for predicting the likelihood that a patient who is a candidate for 
treatment with an EGFR inhibitor will respond to said treatment, comprising determining the 
expression level of one or more prognostic RNA transcripts or their expression products in a 

5 cancer tissue sample obtained from said patient, wherein the prognostic transcript is the 
transcript of one or more genes selected from the group consisting of: STAT5A, STAT5B, 
WISP1, CKAP4, FGFR1, cdc25A, RASSF1, G-Catenin, H2AFZ, NME1, NRG1, BC12, 
TAGLN, YB-1, Src, IGF1R, CD44, DIABLO, TIMP2, AREG, PDGFRa, CTSB, Hepsin, 
ErbB3, MTA1, Gus, and VEGF., wherein (a) over-expression of the transcript of one or more 

10 of STAT5A, STAT5B, WISP1, CKAP4, FGFR1, cdc25A, RASSF1, G-Catenin, H2AFZ, 
NME1, NRG1, BC12, TAGLN, YB1, Src, IGF1R, CD44, DIABLO, TIMP2, AREG, 
PDGFRa, and CTSB, or the corresponding expression product, indicates that the patient is not 
likely to respond well to said treatment, and (b) over-expression of the transcript of one or 
more of Hepsin, ErbB3, MTA, Gus, and VEGF, or the corresponding expression product, 

15 indicates that the patient is likely to respond well to said treatment. 

2. The method of claim 1 comprising determining the expression level of at least 
two of said prognostic transcripts or their expression products. 

3. The method of claim 1 comprising determining the expression level of at least 
5 of said prognostic transcripts or their expression products. 

20 4. The method of claim 1 comprising determining the expression level of all of 

said prognostic transcripts or their expression products. 

5. The method of claim 1 wherein over-expression is determined with reference 
to the mean expression level of all measured gene transcripts, or their expression products, in 
said sample. 

25 6. The method of claim 1 wherein said cancer is selected from the group 

consisting of ovarian cancer, colon cancer, pancreatic cancer, non-small cell lung cancer, 
breast cancer, and head and neck cancer. 

7. The method of claim 1 where the tissue is fixed, paraffin-embedded, or fresh, 
or frozen. 

30 8. The method of claim 1 where the tissue is from fine needle, core, or other 

types of biopsy. 

9. The method of claim 1 wherein the tissue sample is obtained by fine needle 
aspiration, bronchial lavage, or transbronchial biopsy. 
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10. The method of claim 1 wherein the expression level of said prognostic RNA 
transcript or transcripts is determined by RT-PCR. 

1 1 . The method of claim 1 wherein the expression level of said expression product 
or products is determined by immunohistochemistry. 

5 12. The method of claim 1 wherein the expression level of said expression product 

or products is determined by proteomics technology. 

13. The method of claim 1 wherein the assay for measurement of the prognostic 
RNA transcripts or their expression products is provided in the form of a kit or kits. 

14. The method of claim 1 wherein the EGFR inhibitor is an antibody or an 
1 0 antibody fragment. 

15. The method of claim 1 wherein the EGFR inhibitor is a small molecule. 

16. An array comprising polynucleotides hybridizing to the following genes: 
STAT5A, STAT5B, WISP1, CKAP4, FGFrl, cdc25A, RASSF1, G-Catenin, H2AFZ, NME1, 
NRG1, BC12, TAGLN, YB1, Src, IGF1R, CD44, DIABLO, TIMP2, AREG, PDGFrA, 

15 CTSB, Hepsin, ErbB3, MTA, Gus, and VEGF, immobilized on a solid surface. 

17. An array comprising polynucleotides hybridizing to the following genes: 
STATS A, STAT5B, WISP1, CKAP4, FGFR1, cdc25A ? RASSF1, G-Catenin, H2AFZ, 
NME1, NRG1, BC12 5 TAGLN, YB1, Src, IGF1R ? CD44 ? DIABLO, TIMP2, AREG, 
PDGFRa, and CTSB. 

20 18. An array comprising polynucleotides hybridizing to the following genes: 

Hepsin, ErbB3, MTA, Gus, and VEGF. 

19. The array of any one of claims 16-18 wherein said polynucleotides are 
cDNAs. 

20. The array of claim 19 wherein said cDNAs are about 500 to 5000 bases long. 
25 21. The array of any one of claims 16-18 wherein said polynucleotides are 

oligonucleotides. 

22. The array of claim 21 wherein said oligonucleotides are about 20 to 80 bases 

long. 

23. The array of claim 22 which comprises about 330,000 oligonucleotides. 
30 24. The array of any one or claims 16-18 wherein said solid surface is glass. 

25. A method of preparing a personalized genomics profile for a patient, 
comprising the steps of: 
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(a) subjecting RNA extracted from cancer tissue obtained from the patient to gene 
expression analysis; 

(b) determining the expression level in the tissue of one or more genes selected 
from the group consisting of STATS A, STAT5B, WISP1, CKAP4, FGFrl, cdc25A, RASSF1, 

5 G-Catenin, H2AFZ, NME1, NRG1, BC12, TAGLN, YB1, Src, IGF1R, CD44, DIABLO, 
TIMP2, AREG, PDGFRA, CTSB, Hepsin, ErbB3, MTA, Gus, and VEGF, wherein the 
expression level is normalized against a control gene or genes and optionally is compared to 
the amount found in a corresponding cancer reference tissue set; and 

(c) creating a report summarizing the data obtained by said gene expression 
10 analysis. 

26. The method of claim 25 wherein said tissue is obtained from a fixed, paraffin- 
embedded biopsy sample. 

27. The method of claim 26 wherein said RNA is fragmented. 

28. The method of claim 25 wherein said report includes prediction of the 
1 5 likelihood that the patient will respond to treatment with an EGFR inhibitor. 

29. The method of claim 25 wherein the cancer is lung cancer. 

30. The method of claim 25 wherein the cancer is selected from the group 
consisting of colon cancer, head and neck cancer, lung cancer and breast cancer. 

31. The method of claim 25 wherein said report includes recommendation for a 
20 treatment modality of said patient. 

32. A method for amplification of a gene selected from the group consisting of 
STAT5A, STAT5B, WISP1, CKAP4, FGFrl, cdc25A, RASSF1, G-Catenin, H2AFZ, NME1, 
NRG1, BC12, TAGLN, YB1, Src, IGF1R, CD44, DIABLO, TIMP2, AREG, PDGFRA, 
CTSB, Hepsin, ErbB3, MTA, Gus, and VEGF by polymerase chain reaction (PGR), 

25 comprising performing said PGR by using a corresponding amplicon listed in Table 3, and a 
corresponding primer-probe set listed in Table 4. 

33. A PGR primer-probe set listed in Table 4. 

34. A PGR amplicon listed in Table 3 . 

35. A prognostic method comprising: 

30 (a) subjecting a sample comprising cancer cells obtained from a patient to 

quantitative analysis of the expression level of the RNA transcript of at least one gene 
selected from the group consisting of STATS A, STAT5B, WISP1, CKAP4, FGFR1, cdc25A, 



28 



WO 2004/071572 



PCT/US2004/003596 



RASSF1, G-Catenin, H2AFZ, NME1, NRG1, BC12, TAGLN, YB1, Src, IGF1R, CD44, 
DIABLO, TIMP2, AREG, PDGFRa, and CTSB, or their product, and 

(b) identifying the patient as likely to have a decreased likelihood of 
responding well to treatment with an EGFR inhibitor if the normalized expression levels of 
5 said gene or genes, or their products, are elevated above a defined expression threshold. 

36. The method of claim 35 wherein said cancer cells are selected from the group 
consisting of non-small cell lung cancer (NSCLC) cells, colon cancer, head and beck cancer, 
lung cancer and breast cancer cells. 

37. A prognostic method comprising: 

10 (a) subjecting a sample comprising cancer cells obtained from a patient to 

quantitative analysis of the expression level of the RNA transcript of at least one gene 
selected from the group consisting of Hepsin, ErbB3, MTA, Gus, and VEGF, or their product, 
and 

(b) identifying the patient as likely to have an increased likelihood of 
15 responding well to treatment with an EGFR inhibitor if the normalized expression levels of 
said gene or genes, or their products, are elevated above a defined expression threshold. 

38. The method of claim 37 wherein said cancer cells are selected from the group 
consisting of non-small cell lung cancer (NSCLC) cells, colon cancer, head and beck cancer, 
lung cancer and breast cancer cells. 

20 39. The method of claim 35 or 37 wherein the levels of the RNA transcripts of 

said genes are normalized relative to the mean level of the RNA transcript or the product of 
two or more housekeeping genes. 

40. The method of claim 39 wherein the housekeeping genes are selected from the 
group consisting of glyceraldehyde-3-phosphate dehydrogenase (GAPDH), Cypl, albumin, 

25 actins, tubulins, cyclophilin hypoxantine phosphoribosyltransferase (HRPT), L32, 28S, and 
18S. 

41. The method of claim 35 or 37 wherein the sample is subjected to global gene 
expression analysis of all genes present above the limit of detection. 

42. The method of claim 41 wherein the levels of the RNA transcripts of said 
30 genes are normalized relative to the mean signal of the RNA transcripts or the products of all 

assayed genes or a subset thereof. 

43. The method of claim 42 wherein the level of RNA transcripts is determined by 
quantitative RT-PCR (qRT-PCR), and the signal is a Ct value. 
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44. The method of claim 43 wherein the assayed genes include at least 50 cancer 
related genes. 

45. The method of claim 43 wherein the assayed genes includes at least 100 cancer 
related genes. 

5 46. The method of claim 35 or 37 wherein said patient is human. 

47. The method of claim 46 wherein said sample is a fixed, paraffin-embedded 
tissue (FPET) sample, or fresh or frozen tissue sample. 

48. The method of claim 46 wherein said sample is a tissue sample from fine 
needle, core, or other types of biopsy. 

10 49. The method of claim 46 wherein said quantitative analysis is performed by 

qRT-PCR. 

50. The method of claim 46 wherein said quantitative analysis is performed by 
quantifying the products of said genes. 

51. The method of claim 50 wherein said products are quantified by 
1 5 immunohistochemistry or by proteomics technology. 

52. The method of claim 35 further comprising the step of preparing a report 
indicating that the patient has a decreased likelihood of responding to treatment with an 
EGFR inhibitor. 

53. The method of claim 37 further comprising the step of preparing a report 
20 indicating that the patient has an increased likelihood of responding to treatment with an 

EGFR inhibitor. 

54. A kit comprising one or more of (1) extraction buffer/reagents and protocol; 
(2) reverse transcription buffer/reagents and protocol; and (3) qPCR buffer/reagents and 
protocol suitable for performing the method of any one of claims 1, 35 and 37. 

25 
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39740-0009 PCT.txt 
SEQUENCE LISTING 

<110> GENOMIC HEALTH, INC. 

CEDARS-SINAI MEDICAL CENTER 
AGUS, David 
shak, Steven 
cronin, Maureen T. 
BAKER, J off re B. 

<120> Gene Expression Markers for Response to 
EGFR Inhibitor Drugs 

<130> 39740-0009 PCT 

<140> Not Assigned 
<141> 2004-02-05 

<150> US 60/445,968 
<151> 2003-02-06 

<160> 108 

<170> FastSEQ for windows version 4.0 

<210> 1 
<211> 82 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Amp! icon 
<400> 1 

tgtgagtgaa atgccttcta gtagtgaacc gtcctcggga gccgactatg actactcaga 60 
agagtatgat aacgaaccac aa 82 

<210> 2 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 2 

cagatggacc tagtacccac tgagatttcc acgccgaagg acagcgatgg gaaaaatgcc 60 
cttaaatcat agg ~ 73 

<210> 3 
<211> 78 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Amplicon 
<400> 3 

gacgaagaca gtccctggat caccgacagc acagacagaa tccctgctac cagagaccaa 60 
gacacattcc accccagt ~ 78 

<210> 4 
<211> 71 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Amp neon 
<400> 4 

tcttgctggc tacgcctctt ctgtccctgt tagacgtcct ccgtccatat cagaactgtq 60 
ccacaatgca g w 71 

<210> 5 
<211> 66 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Amp! icon 
<400> 5 

aaagcctcag teagecaagt ggaggeggae ttgaaaatgc teaggactge tgtggacagt 60 
ttggtt ~ 66 

<210> 6 
<211> 62 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amp! icon 
<400> 6 

ggecgagate tacaaaaacg gccccgtgga gggagctttc tctgtgtatt cggacttcct 60 
gc 62 

<210> 7 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 7 

cacaatggcg gctctgaaga gttggctgtc gcgcagcgta acttcattct tcaggtacag 60 
acagtgtttg tgt ~ " 73 

<210> 8 
<211> 81 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 8 

cggttatgtc atgecagata cacacctcaa aggtactccc tcctcccggg aaggcaccct 60 
ttcttcagtg ggtctcagtt c " 81 

<210> 9 
<211> 74 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 9 

cacgggacat tcaccacatc gactactata aaaagacaac caacggccga ctgcctgtga 60 
agtggatggc accc "* ^ 74 

<210> 10 
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<211> 68 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Amp! icon 
<400> 10 

tcagcagcaa gggcatcatg gaggaggatg aggcctgcgg gcgccagtac acgctcaaga 60 
aaaccacc ~ ~ 58 

<210> 11 
<211> 73 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Amplicon 
<400> 11 

cccactcagt agccaagtca caatgtttgg aaaacagccc gtttacttga gcaagactga 60 
taccacctgc gtg 73 

<210> 12 
<211> 71 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Amplicon 
<400> 12 

ccggaaaggc caagacaaag gcggtttccc gctcgcagag agccggcttg cagttcccag 60 
tgggccgtat t ^ 71 

<210> 13 
<211> 84 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 13 

aggctgctgg aggtcatctc cgtgtgtgat tgccccagag gccgtttctt ggccgccatc 60 
tgccaagact gtggccgcag gaag " 84 

<210> 14 
<211> 83 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Amplicon 
<400> 14 

gcatggtagc cgaagatttc acagtcaaaa tcggagattt tggtatgacg cgagatatct 60 
atgagacaga ctattaccgg aaa ~ ~ 83 

<210> 15 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
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<400> 15 

ccgccctcac ctgaagagaa acgcgctcct tggcggacac tgggggagga gaggaagaag 60 
cgcggctaac ttattcc 77 

<210> 16 
<211> 74 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amp! icon 
<400> 16 

ccaaccctgc agactccaag cctgggacca tccgtggaga cttctgcata caagttggca 60 
ggaacattat acat * ~ ~ ~ 74 

<210> 17 
<211> 83 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Amp! icon 
<400> 17 

cgagactctc ctcatagtga aaggtatgtg tcagccatga ccaccccggc tcgtatgtca 60 
cctgtagatt tccacacgcc aag 83 

<210> 18 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 18 

gggagtttcc aagagatgga ctagtgcttg gtcgggtctt ggggtctgga gcgtttggga 60 
aggtggttga ag 72 

<210> 19 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 19 

agtgggagac acctgacctt tctcaagctg agattgagca gaagatcaag gagtacaatg 60 
cccagatca ^ 69 

<210> 20 
<211> 64 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 20 

cctgaacatg aaggagctga agctgctgca gaccatcggg aagggggagt tcggagacgt 60 
gatg 64 

<210> 21 
<211> 77 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Amp"! icon 
<400> 21 

gaggcgctca acatgaaatt caaggccgaa gtgcagagca accggggcct gaccaaggag 60 
aacctcgtgt tcctggc 77 

<210> 22 
<211> 74 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amp! icon 
<400> 22 

ccagtggtgg tgatcgttca tggcagccag gacaacaatg cgacggccac tgttctctgg 60 
gacaatgctt ttgc 74 

<210> 23 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amp! icon 
<400> 23 

gatggagcag gtggctcagt tcctgaaggc ggctgaggac tctggggtca tcaagactga 60 
catgttccag act 73 

<210> 24 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 24 

tcaccctctg tgacttcatc gtgccctggg acaccctgag caccacccag aagaagagcc 60 
tgaaccaca 69 

<210> 25 
<211> 71 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 25 

ctgctgtctt gggtgcattg gagccttgcc ttgctgctct acctccacca tgccaagtgg 60 
tcccaggctg c 71 

<210> 26 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 26 

agaggcatcc atgaacttca cacttgcggg ctgcatcagc acacgctcct atcaacccaa 60 
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gtactgtgga gtttg 75 

<210> 27 
<211> 76 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Amplicon 
<400> 27 

agactgtgga gtttgatgtt gttgaaggag aaaagggtgc ggaggcagca aatgttacag 60 
gtcctggtgg tgttcc " ^ ~ 76 

<210> 28 
<211> 27 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> forward primer 
<400> 28 

tgtgagtgaa atgccttcta gtagtga 27 

<210> 29 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 

<400> 29 

ccgtcctcgg gagccgacta tga 23 

<210> 30 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 30 

ttgtggttcg ttatcatact cttctga 27 

<210> 31 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 
<400> 31 

cagatggacc tagtacccac tgaga 25 

<210> 32 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 



<400> 32 
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ttccacgccg aaggacagcg at 

<210> 33 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> reverse primer 
<400> 33 

cctatgattt aagggcattt ttcc 

<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 34 

gacgaagaca gtccctggat 

<210> 35 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 35 

actggggtgg aatgtgtctt 

<210> 36 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 36 

caccgacagc acagacagaa tccc 

<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 37 

tcttgctggc tacgcctctt 

<210> 38 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 38 

tgtccctgtt agacgtcctc cgtccata 
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<210> 39 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 39 

ctgcattgtg gcacagttct g 21 

<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 



<210> 41 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> reverse primer 
<400> 41 

aaccaaactg tccacagcag 20 

<210> 42 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 



<210> 43 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 43 

ggccgagatc tacaaaaacg 20 

<210> 44 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 



<400> 40 

aaagcctcag tcagccaagt 



20 



<400> 42 

tcctgagcat tttcaagtcc gcct 



24 



<400> 44 

gcaggaagtc cgaatacaca 



20 
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<210> 45 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 45 

ccccgtggag ggagctttct c 

<210> 46 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 46 

cacaatggcg gctctgaag 

<210> 47 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> reverse primer 
<400> 47 

acacaaacac tgtctgtacc tgaaga 

<210> 48 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 48 

aagttacgct gcgcgacagc caa 

<210> 49 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 49 

cggttatgtc atgccagata cac 

<210> 50 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 50 

cctcaaaggt actccctcct cccgg 
<210> 51 
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<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 51 

gaactgagac ccactgaaga aagg 24 

<210> 52 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 
<400> 52 

cacgggacat tcaccacatc 20 

<210> 53 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 53 

gggtgccatc cacttcaca 19 

<210> 54 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 54 

ataaaaagac aaccaacggc cgactgc 27 

<210> 55 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 

<400> 55 

tcagcagcaa gggcatcat 19 

<210> 56 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> reverse primer 
<400> 56 

ggtggttttc ttgagcgtgt act 23 

<210> 57 
<211> 19 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 57 

cgcccgcagg cctcatcct 19 

<210> 58 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 58 

cccactcagt agccaagtca 20 

<210> 59 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> probe 
<400> 59 

tcaagtaaac gggctgtttt ccaaaca 27 
<210> 60 

<2U> 20 f 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 60 

cacgcaggtg gtatcagtct 20 

<210> 61 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 



18 



<400> 61 

ccggaaaggc caagacaa 

<210> 62 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 62 

aatacggccc actgggaact 20 



<210> 63 
<211> 19 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> probe 
<400> 63 

cccgctcgca gagagccgg 19 

<210> 64 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 64 

aggctgctgg aggtcatctc 20 

<210> 65 
<211> 19 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> reverse primer 
<400> 65 

cttcctgcgg ccacagtct 19 

<210> 66 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> probe 
<400> 66 

ccagaggccg tttcttggcc g 21 

<210> 67 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 67 

gcatggtagc cgaagatttc a 21 

<210> 68 
<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> reverse primer 
<400> 68 

tttccggtaa tagtctgtct catagatatc 30 

<210> 69 
<211> 28 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> probe 
<400> 69 

cgcgtcatac caaaatctcc gattttga 

<210> 70 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 
<400> 70 

ccgccctcac ctgaagaga 

<210> 71 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 71 

ggaataagtt agccgcgctt ct 

<210> 72 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> probe 
<400> 72 

cccagtgtcc gccaaggagc g 

<210> 73 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 73 

ccaaccctgc agactccaa 

<210> 74 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 74 

atgtataatg ttcctgccaa cttgtatg 

<210> 75 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> probe 
<400> 75 

cctgggacca tccgtggaga cttct 

<210> 76 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 
<400> 76 

cgagactctc ctcatagtga aaggtat 

<210> 77 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 77 

cttggcgtgt ggaaatctac ag 

<210> 78 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 78 

atgaccaccc cggctcgtat gtca 

<210> 79 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 79 

gggagtttcc aagagatgga 

<210> 80 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 80 

cccaagaccc gaccaagcac tag 

<210> 81 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> reverse primer 
<400> 81 

cttcaaccac cttcccaaac 

<210> 82 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 82 

agtgggagac acctgacctt 

<210> 83 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> reverse primer 
<400> 83 

tgatctgggc attgtactcc 

<210> 84 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 84 

ttgatcttct gctcaatctc agcttgaga 

<210> 85 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 
<400> 85 

cctgaacatg aaggagctga 

<210> 86 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> reverse primer 
<400> 86 

catcacgtct ccgaactcc 

<210> 87 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
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<400> 87 

tcccgatggt ctgcagcagc t 



21 



<210> 88 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 
<400> 88 

gaggcgctca acatgaaatt c 21 

<210> 89 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 



<210> 90 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 90 

cggttgctct gcacttcggc ct 22 

<210> 91 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 



<210> 92 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 92 

gcaaaagcat tgtcccagag a 21 

<210> 93 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 



<400> 89 

gccaggaaca cgaggttctc 



20 



<400> 91 

ccagtggtgg tgatcgttca 



20 
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<400> 93 

cagccaggac aacaatgcga egg 

<210> 94 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 
<400> 94 

gatggagcag gtggctcagt 

<210> 95 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 95 

agtctggaac atgtcagtct tgatg 

<210> 96 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 96 

cccagagtcc tcagccgcct tcag 

<210> 97 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 
<400> 97 

tcaccctctg tgacttcatc gt 

<210> 98 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 98 

tgtggttcag gctcttcttc tg 

<210> 99 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> probe 
<400> 99 
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ccctgggaca ccctgagcac ca 

<210> 100 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 100 

ctgctgtctt gggtgcattg 

<210> 101 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 101 

ttgccttgct gctctacctc caeca 

<210> 102 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 102 

gcagcctggg accacttg 

<210> 103 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> forward primer 
<400> 103 

agaggcatcc atgaacttca ca 

<210> 104 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> reverse primer 
<400> 104 

caaactccac agtacttggg ttga 

<210> 105 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> probe 
<400> 105 

egggctgeat cagcacacgc 
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<210> 106 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> forward primer 
<400> 106 

agactgtgga gtttgatgtt gttga 

<210> 107 
<2U> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> reverse primer 
<400> 107 

ggaacaccac caggacctgt aa 

<210> 108 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> probe 
<400> 108 

ttgctgcctc cgcacccttt tct 
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